Malte Buettner

mentions 1 type Person feed RSS

// recent coverage 1 mentions

00:00

2026-05-14

maltebuettner.eu

large-language-models

documentai bbox benchmark

Malte Buettner benchmarked bounding box accuracy for Document AI models using pages from the FlashAttention-3 paper, testing Qwen, Kimi, and Mistral via OpenRouter. The evaluation scored models on cov…

// co-occurs with top 7 entities

ExtractBench 1 ContextualAI 1 OpenRouter 1 Qwen 1 Kimi 1 Mistral 1 FlashAttention-3 1